libboilerpipe-java software package provides Boilerplate removal and fulltext extraction from HTML pages, you can install in your Ubuntu 14.04 LTS (Trusty Tahr) by running the commands given below on the terminal,
$ sudo apt-get update $ sudo apt-get install libboilerpipe-java
libboilerpipe-java is installed in your system.
Make ensure the libboilerpipe-java package were installed using the commands given below,
$ sudo dpkg-query -l | grep libboilerpipe-java *
You will get with libboilerpipe-java package name, version, architecture and description in a table
| Distro | Ubuntu 14 04 lts trusty tahr |
| Section | Java |
| Package | libboilerpipe-java |
| Version | 1.2.0-1 |
| Description | Boilerplate removal and fulltext extraction from HTML pages |
| Sub Section | Java |